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Abstract 

In a balls-in-bins process with feedback, balls are sequentially thrown into bins so 
that the probability that a bin with n balls obtains the next ball is proportional to f(n) 
for some function /. A commonly studied case where there are two bins and f{n) = n p 
for p > 0, and our goal is to study the fine behavior of this process with two bins and a 
large initial number t of balls. Perhaps surprisingly, Brownian Motions are an essential 
part of both our proofs. 

For p > 1/2, it was known that with probability 1 one of the bins will lead the process 
at all large enough times. We show that if the first bin starts with t + X^/i balls (for 
constant A € R), the probability that it always or eventually leads has a non-trivial 
limit depending on A. 

For p < 1/2, it was known that with probability 1 the bins will alternate in leadership. 
We show, however, that if the initial fraction of balls in one of the bins is > 1/2, the 
time until it is overtaken by the remaining bin scales like 6(* 1+1 /( 1 - 2p )) for p < 1/2 
and exp(0 (t)) for p = 1/2. In fact, the overtaking time has a non-trivial distribution 
around the scaling factors, which we determine explicitly. 

Our proofs use a continuous-time embedding of the balls-in-bins process (due to Rubin) 
and a non-standard approximation of the process by Brownian Motion. The techniques 
presented also extend to more general functions /. 

1 Introduction 

1.1 The process 

If / : N — > (0, +oo) is a positive function, a balls-in-bins process with feedback function f a 
discrete-time Markov process with B bins, each one of which containing Ij(m) > balls at 
time m for each m 6 {0, 1,2,...} and i 6 {1, . . . , B}. Its evolution is as follows: at each time 
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m > 0, a ball is added to a bin i m , so that Ii m (m) = Ii m (m — 1) + 1 and Ij(m) = Ij(m — 1) 
for all % G {1, . . . , B}\{i m }, and the random choice of bin i m has distribution 

Pr(i m =i\{I j (m-l) : 1 < j < B}) = ~ * )} - - (1 < i < S), (1) 

Ej=i/( / i( m - !)) 

We will commonly refer to this recipe by saying that bin i receives a ball at time m (i.e. 
im = i) with probability proportional to f(I m (i)). 

Such processes 1 were introduced to the Discrete Mathematics community by Drinea, 
Frieze and Mitzenmacher jS], where they were motivated by economical problems of com- 
petition and mathematically related preferential attachment models for large networks 2 . 
That paper treats only the case where f(x) = x p for some parameter p > 0. In this case / 
is increasing, and therefore the rich get richer: the more balls a bin has, the more likely it is 
to receive the next ball. In economic terms, one could think of bins as competing products 
and balls as customers; in that case, the more popular a product is, the more likely it is to 
obtain a new customer. 

The main question addressed in that paper is whether this phenomenon ensures large 
advantages in the long run for some bin. The authors show that if p > 1, there almost 
surely exists one bin that gets all but a negligible fraction of the balls in the large-time 
limit; whereas for p < 1, the asymptotic fractions of balls in each bin even out with time. 
The p = 1 case is the classic Polya Urn model, for which it has been long known that the 
number of balls in each bin converges almost surely to a non-degenerate random variable, 
and thus the process has different regimes depending on the choice of parameter p. 

1.2 The three regimes 

However, a much stronger result is available. A paper by Khanin and Khanin [7] introduced 
what amounts to the same process as a model for neuron growth, and proved that if p > 1, 
there almost surely is some bin that gets all but finitely many balls, an event that we call 
monopoly. They also show that for 1/2 < p < 1, monopoly has probability 0, but there 
almost surely will be some bin which will lead the process from some finite time on (we 
call this eventual leadership), whereas this cannot happen if < p < 1/2. Therefore, the 
balls-in-bins process has three regimes of behavior corresponding to three ranges of p. 

In fact, the result of generalizes to any / with min xg N/(x) > 0. Consider the 
following events, which we call monopoly by bin i G [B], 

Mon, t = {3Af e N Vm > M Vj G [B] j^i^ I rn (j) = I M (j)} (2) 
= {3M G N Vm > M i m = i}; 
X A longer background discussion is available from the author's PhD thesis [10| . 

More specifically, the model introduced by Krapivsky and Redner and independently by Driena, 
Enachescu and Mitzemacher This model generalizes the Barabasi- Albert model, which is discussed in 
the survey 0. 
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and eventual leadership by bin i £ [B]: 

ELeadi = {3M e N Vm > M Vj € [B] j ^ i I m (j) < J m (*)}. (3) 

Clearly, ELeadj D Morij. A not-too-difficult extension of 7 (proven in |131 HUj ) says that 

Theorem 1 (From [71 1131 TlOj^ // {-fm}m=0 ^ s a balls-in-bins process with B bins and 
feedback function f = f(x) > c for some c > 0, then there are three mutually exclusive 
possibilities. 

L J2 n >i /( n ) _1 < +°°; in which case Pr (u|Li Mon*) = Pr (uf =1 ELead,) = 1 (we call 
this the monopolistic regime); 

2. J2 n >i f( n ) 1 = +°° but Y^ n >i fi 71 ) 2 < +°°; in which case Pr (uf =1 Morij) = but 
Pr (uf =1 ELeadj) = 1 (this is the eventual leadership regime,); 

3 - En>i f( n )~ 2 = in which case Pr (ug =1 Morij) = Pr (uf =1 ELead^) = (this is 

the almost-balanced regime ). 

This holds irrespective of the initial conditions of the process. 

Notice that the three cases of the Theorem applied to the fix) = x p family correspond to 
p > 1, 1/2 < p < 1 and < p < 1/2; in other words, this family of / has phase transitions 
at p = 1 and p = 1/2. We sketch a proof of this result in Section f3. 21 both for completeness 
and to give readers a better acquaintance with the techniques in the present paper. 

1.3 The present work 

This paper is part of a series by the present author in collaboration with Michael Mitzen- 
macher and Joel Spencer 1111 112j that attempts to quantify different aspects of the three 
qualitative regimes presented in Theorem ^ Our specific purpose in the present paper is 
to prove two not-quite-related results about these processes in different regimes, when the 
initial number of balls on both bins is large. What brings these two results is that both 
proofs use Brownian Motion in an unexpected and surprising way. 

Our first result is a scaling result for the eventual leadership and monopoly regimes. 
Suppose, for simplicity, that f{x) = x p with p > 1/2. Recall the definition of eventual 
leadership by bin i (J3J), and let Leadj be the event that bin 1 leads the process at all times: 

Lead,; = {Vm > Vj G [B] j^i^ I m (j) < ^m(0}- ( 4 ) 
If the initial number of bins is t S> 1 and Iq{V) ~ t/2, then Pr (Leadj) ~ and 
Pr (ELeadj) ~ 1 for i = 1,2. On the other hand, if /rj(l) is much larger than Iq{2), one 
think that Pr (Leadi) ,Pr (ELeadi) ~ 1. Our question is how large is large enough? That is 
to say, at what scale do these two probabilities grow from to 1? We show that the answer 
is in fact (v^), and give an exact asymptotic result. 
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Theorem 2 Let {Xt}teN C R /orm a sequence such that such that 



A 



lim A; G R exists 



and 




4p- 2 



€ N. 



Assume that for each t, {Im (j) '■ j £ [2],m. > 0} is a balls-in-bins process with two bins, 
feedback function f(x) = x v , 1/2 < p < +oo and initial conditions 



This theorem is an extension of a result by Mitzenmacher, Oliveira and Spencer [S], 
who showed a similar scaling for Moni when p > 1. That paper used Esseen's inequality 
for approximation by Gaussians together with a continuous-time embedding of the balls-in- 
bins process; we shall also use the latter device together with approximation by Brownian 
Motion, especially to estimate Prf x n-\ y n\\ (Mortj). Notice that as p \ 1/2 the scaling term 



y 4p_2 becomes bigger; i.e. near the p = 1/2 phase transition, it becomes harder to bias 
the process towards (eventual) leadership by either bin. 

The second result we prove is about the almost-balanced case. Suppose, again for 
simplicity, that there are two bins (B = 2) and f{x) = x p , < p < 1/2, . In this case 
Theorem n sa ys that for any initial conditions Io(l), Io(2) with /o(l) < -^o(2), there is a 
time m > such that I m (l) > I m {2). Call the first such time the overtaking time V. By 
the above, V < +oo, but we have no idea of the distribution of V, and thus we don't know 
how long the overtaking might take. We show that if the initial number of balls is large 
and bin 2 has a non-negligibly bigger fraction of the initial balls, then V can actually be 
quite large; moreover, it has an explicit asymptotic distribution. 

Theorem 3 Let Vt a be the overtaking time in a balls-in-bins process with feedback function 
f(x) = x p (withp £ (0, 1/2] constant) and initial conditions (\at \ , t— \at] ) for < a < 1/2. 




Then 




(6) 
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Then there exist random variables {Ut, a ,p}t€~N such that 



1-P _ a l-P]2 + 



(1-a) 



l-2p „2 



} 



l-2p ^+1=^ 




2(1 - a)t exp |4[1 - 

«/P = 5, 




2 v /a(l - a)] 



7^ 



t,a, 




(*+l) 



(7) 



mt/i probability tending to 1 as t 



+00, and 




where N is a standard Gaussian random variable. 

This means that V becomes larger and larger as p f 1/2, culminating with the exponen- 
tial behavior at the phase transition point p = 1/2. The economically-inclined might wish 
to deduce from this theorem that, under appropriate initial conditions, a product's leader- 
ship might last a long time even in markets with no propensity for breeding monopolies or 
"eternal leaders". 

1.4 Techniques and outline 

Our results in this paper are actually more general: they extend to a broader (though not 
entirely general) class of functions / and, in the case of Theorem H3 to more than two bins. 
All proofs below are done for this more general case and then specialized for f(x) = x p . 

Our proofs have their first two steps in common. The first step has been employed in 
113j and other works, and seems to have originated in Davis' work on reinforced random 
walks (!]. We shall embed the discrete-time process we are interested in into a continuous- 
time process built from exponentially distributed random variables, so that interarrival 
times at different bins are independent and have an explicit distribution, which is very 
helpful in calculations. We call this the exponential embedding of the process. 

The second technique we use is approximation by Brownian motion via Donsker's Imparl- 
ance Principle. While neither technique is novel, their conjunction in the way presented here 
yields surprising explicit results in the asymptotic regime, once the appropriate calculations 
are done. 

The remainder of the paper is organized as follows. We discuss preliminary material in 
Section |2 Section |3] rigorously introduces the exponential embedding process and discusses 
its key properties. In Section 0] we detail the assumptions we make on our feedback func- 
tions /, while also deriving some consequences of those assumptions. Section proves the 
general version of Theorem [2 whereas Section [7| contains the proof of the generalization of 
Theorem [3 Finally, Section El discusses extensions to our results and some open problems. 
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2 Preliminaries 

General notation. Throughout the paper, N = {1, 2, 3, ... } is the set of non-negative inte- 
gers, R + = [0, +oo) is the set of non-negative reals, and for any k S N\{0} [k] = {1, . . . , k}. 

Asymptotics. We use the standard O/o/Q/Q notation. The expressions "a n ~ b n as 
n — > n " and "a n <C b n as n — > n " mean that lim n __ +no (a n /6 n ) = 1 and ]hn n ^ no (a n /b n ) = 0, 
respectively. 

Balls-in-bins. Formally, a balls-in-bins process with feedback function / : N —* (0 + oo) 
and B G N bins is a discrete-time Markov chain {(Ji(m), . . . , /B(^))}m=o w ith state space 
N B and transition probabilities given in the Introduction (see (|T|)). We will usually refer to 
the index i m £ [B] as the bin that receives a ball at time m. For any B, if E is an event of the 
process and u € N B , Pr n (E) is the probability of E when the initial conditions are set to u. 

Exponential random variables. X = d exp(A) means that X is a random variable with 
exponential distribution with rate A > 0, meaning that X > and 

Pr(X >t) = e~ xt (t > 0). 

The shorthand exp(A) will also denote a generic random variable with that distribution. 
Some elementary but extremely useful properties of those random variables include 

1. Lack of memory. Let X = d exp(A) and Z > be independent from X. The distribu- 
tion of X — Z conditioned on X > Z is still equal to exp(A). 

2. Minimum property. Let {X{ = d exp(Aj)}™ 1 be independent. Then 



-^min = min Xi = exp(Ai + A 2 + . . . X r , 

Ki<m 



and for all 1 < i < m 

Pr (X t = X miQ ) = — (8) 

3. Multiplication property. If X = d exp(A) and r\ > is a fixed number, r\X = d exp(A/?7). 
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4. Moments and transforms. If X = d exp(A), r G N and i G R, 

ExpT] = (9) 

Ex [e«l = ( ^1 {t<X) (10) 
L J \ +oo (t>A) 

Weak convergence. X n Y means that the sequence {J^} of random variables 
converges weakly to Y as n — > +oo. 

Gaussians and cumulative distribution functions. Finally, we restate the definitions of 
$ and r in Theorem 



fo" /-max{A,0} 

T(A) = J- / e~ x2 l 2 dx. (12) 



If iV is a standard Gaussian random variable, V is the cdf (cumulative distribution 
function) of N and V is the cdf of \N\. 

3 The exponential embedding 
3.1 Definition and key properties 

Let / : N — > (0,+oo) be a function, B G N and (oi, . . . ,ae) G N b . We define below a 
continuous-time process with state space (N U {+cxd}) b and initial state (ai,...,ae) as 
follows. Consider a set {X(i, j) : i G [B], j G N} of independent random variables, with 
X(i,j) = d exp(/(j)) for all (i,j) G [B] x N, and define 



Ni(t) = sup < 



n-l 



gN: Vl(i,i)<t (iG[ J B],tGR+ = [0,+oo)), (13) 



j=<H 



where by definition ^2j =i (. • • ) = if i > k. Thus iVj(O) = at for each i G [-B], and one 
could well have iVj(T) = +oo for some finite time T (indeed, that will happen for our 
cases of interest); but in any case, the above defines a continuous-time stochastic process, 
and in fact the {Ni(-)}f =1 processes are independent. Each one of this processes is said to 
correspond to bin i, and each one of the times 

X(i, ai),X(i, ai) + X(i, ai + 1), X(i, a,,) + X(i, + 1) + X(i, aj + 2), . . . 
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is said to be an arrival time at bin i. As in the balls-in-bins process, we imagine that each 
arrival correspond to a ball being placed in bin i. 

In fact, we claim that this process is related as follows to the balls-in-bins process with 
feedback function /, B bins and initial conditions (oj, . . . , «b). 

Theorem 4 (Proven in [4J [71 1131 11UL Let the {-^(Olie^B] process be defined as above. 

One can order the arrival times of the B bins in increasing order ( up to their first accu- 
mulation point, if they do accumulate) so that T\ <T% < . . . is the resulting sequence. The 
distribution of 

{I m = (N 1 (T m ),N 2 (T m ), N B (T m ))} m&N 

is the same as that of a balls-in-bins process with feedback function f and initial conditions 
(ai,a 2 , . . . ,a B ). 

One can prove this result 3 as follows. First, notice that the first arrival time T\ is 
the minimum of X(j,aj), (1 < j < B). By the minimum property presented above, 
the probability that bin i is the one at which the arrival happens is like the first arrival 
probability in the corresponding balls-in-bins process with feedback: 

Pr ( X(i,aA = min X(j, aj ) J = — . (14) 

More generally, let t € R + and condition on (Ni(t))^ =1 = {bi)f =l S N B , with bi > ai for 
each i (in which case the process has not blown up). This amounts to conditioning on 

bi—l bi 



Oi — l Oi 

V» G [B] Yl hi "> - 1 < Yl bi 

j=cti j=a,i 



From the lack of memory property of exponentials, one can deduce that the first arrival 
after time t at a given bin i will happen at a exp(/(6j))-distributed time, independently for 
different bins. This almost takes us back to the situation of (|14jl. with bi replacing a^, and 
we can similarly deduce that bin i gets the next ball with the desired probability, 

f(h) 



Ef=i /(&,■) 



3 The exact attribution of this result is somewhat confusing. Ref. |7j cites the work of Davis |1] on 
reinforced random walks, where it is in turn attributed to Rubin. 
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3.2 On the three regimes 



Let us now briefly point out some of the key steps in the proof of Theorem ^ via the 
exponential embedding, in the case B = 2. Reading this sketch might help the reader to 
become acquainted with an important part of our methods. 

We use the same notation and random variables introduced above. 

Assume we start the process from state (x, y) £ N 2 . First, we note that 



Ex 



+00 



+00 J 



J=X 



fur 



(15) 



hence if the RHS is finite, -^(l>i) < +°° almost surely, and similarly for Yl^^y -^(2, j) ■ 

Moreover, the two random series are independent, and neither has point-masses in their 
distribution. Therefore, with probability 1, 



+00 



+00 



+00 



either ^X(l,j) < ^X(2,j) or ^X(l,j) > ^X(2,j) 

j=x j=y j=x j=y 

If the first alternative holds, there exists a finite M > y such that 

+00 Af-l 



(16) 



J=X 



Now notice that the sequence {T nfc 
ball arrival times {T n } ng N, and at those times 

x+k Af-l 

Vfc e N T„ fc = ^I(lj) < E ^(2,j) 



Eji* -^(l>i)}feeN is an infinite subsequence of the 



In k {2)<M. 



Since {I m (2)} m is an increasing sequence, this means that I m (2) < M for all m G N; that 
is to say, bin 1 nuts/; achieve monopoly. On the other hand, if the second alternative in (|16l) 
holds, the same argument shows that bin 2 must achieve monopoly. Thus the condition (1151) 
implies that with probability 1, one of the two bins achieves monopoly. It is not too hard to 
prove that if (fT5)) does not hold, then almost surely J2^^X(l,j) = J2j^.X(2,j) = +00; 
in fact, it suffices to show that, for some p > 



Ex 



x+k 

exp(- P J2x(l,j)) 



J=X 



x+k 



Y[ Ex [exp(-pX(l,j))] as k -> +00 
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and similarly for Yuj=*y X(2, j)- In this case one can show, by reversing the above reasoning, 
with probability 1 no bin will achieve monopoly. 
Now assume that x > y (for simplicity) and 

+00 

^m <+ao - <17) 

In this case, even if £ . f(j) = +00, the series 

+00 

J2(X(l,j)-X(2J)) 

j=x 

is made of independent, centered random variables whose variances satisfy 

+00 +00 „ 

Var ((X(l,j) - X(2,j))) = £ 

j=x j=l J{J) 

Hence Kolmogorov's Three Series Theorem implies that ^t^(X(l,j) — X(2,j)) converges. 
Following the reasoning developed above, we deduce that almost surely 

+00 x— 1 +00 X — 1 

either j) - X(2, j)) - £ ^(2,j) < or j) - X(2, j)) - ^ X(2,j) > 0. 

j=x j=y j=x j=y 

In the first case, for all large enough M 

M-i M-l 

j=x j=y 

and one can check that this means that for all large enough M, bin 1 reaches level M 
before bin 2 does (in the embedded and continuous-time processes): that is, bin 1 achieves 
eventual leadership. Otherwise, if £j^(^(l,i) -X(2,j)) X(2,j) > 0, bin 2 is the 

one that achieves eventual leadership. In either case, what we have discussed up to now 
proves items 1. and 2. of Theorem ^ 
Finally, if 

+00 _. 

g/0F = +O0 ' <18) 



then for any x, as k — > +00 

x+k ^ 1 ^ / _l 



/ \ 3/2 

X + fC 1 X + K / X + K 



_ /(j)3 " ,nin ; ; f(j) ^ /(j)2 \~ x f(J) 2 
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Checking the moments of the X(i,j)'s and using the results in Section [5.21 shows that the 
sums Y^j=y X(2, j) (k 6 N) are in the domain of attraction of Brownian 

Motion for any x and y. This implies that there is a sequence of random numbers M\ < 
M2 < M3 < . . . and a constant < a < 1 such that for all n £ N 



a<Pr £l(l,j)<£l(2, 



j=y 



{k 1 ,...,k n - 1 }U{X(l,£),X(2J) :£<k n ^} < 1- 



o. 



This implies that both £^X(l,j) < £^X(2,j) and E]l x X(l,j) > £^,X(2,j) 
must occur infinitely often almost surely. In this case, there are infinitely many k for which 
bin 1 reaches level k before bin 2 does, and vice-versa. It follows that (J18|) implies that 
with probability 1 neither bin will achieve eventual leadership, and this proves 3. and the 
theorem. 



Remark 1 Assume that bin 1 achieves monopoly. Then all arrivals of the continuous-time 
process at bin 2 after time 'Y1^= X X{1, j) do not actually happen in the embedded discrete- 
time process {I m = (I m (l), 7 m (2))}. We call these "ghost events" a fictitious continuation 
of our process. This very useful device is akin to the continuation of a Galton-Watson 
process beyond its extinction time (see e.g. 'BJ) and is equally useful in calculations and 
proofs. 



4 Assumptions on feedback functions 

The purpose of this rather technical section is two- fold. First, we spell out the technical 
assumptions on the feedback function / that we need in our proofs. Nothing seems to 
actually require these assumptions, but they facilitate certain estimates that we employ in 
the proofs. 

Some readers might wish to skip the proofs in this section on a first reading. 



4.1 Valid feedback functions 

The feedback functions we allow in our results satisfy the following definition. 

Definition 1 A function f : N — > (0,+oo) with /(l) = l 4 is said to be a valid feedback 
function if it can be extended to a piecewise C 1 function g : R + U {0} — ► (0, +00) with the 
following property: if (In g(-))' is the right-derivative of In g, and h(x) = x(\ng(x))' (for 

ieR + u {0} 

1. liminf^+oo h(x) = h min > 0; 

4 The requirement that /(l) = 1 is just a normalization condition, as it does not change the process. 
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2. 



lim 



X 



l±h{x) = 0; 



3. 



there exist C > and xq G R + such that for all e G (0, 1) and all x > xq 



(19) 



X<t<X 1 + e 



// m addition h m i n > 1/2, i/ien we say that f is ELM (ELM stands for "eventual 
leadership or monopoly"). If on the other hand h(x) < 1/2 for all large enough x, we 
say f is AB ("almost-balanced"). 

With slight abuse of notation, we will always assume that f is defined over R + U {0} and 
is piecewise C . We will also call h the characteristic exponent of f. 

Functions with exponential growth (such as f(x) = 2 X ) or with oscillations fail to satisfy 
Definition ^ On the other hand, requiring that / be increasing seems natural, and the 
smoothness assumption still leaves us with plenty of interesting examples of feedback func- 
tions; some examples are given in Tabled The "canonical case" where f(x) = x v (x > 1) 
explains the terminology for the characteristic exponent: in that case, h(x) = p for all 
x > 1. 

4.2 Consequences of the definition 

Let us now define the quantity 



for some / : N — > (0,+oo), and also let S r (n) = S r (n, +oo). If f(x) = x p , then for 
m — n, n 3> 1 a simple shows that 



The main content of the following lemmata (the first one proven in jllj ) is that a similar 
result holds for any valid /, if p is replaced by the characteristic exponent h. In particular, 
any valid / satisfies the monopoly condition in Theorem ^ These lemmas are used in the 
two main proofs in the paper. 

Lemma 1 ([llj) Assume that r is an integer and f is a valid feedback function with char- 
acteristic exponent h satisfying h m i n > 1/r. Define 

, ^ f + °° dx , 




(20) 





12 



Then, as n — > +cx) 

Ti 

S r (n) ~ M r (n) ~ 

[rn{n) — l)j{ny 

5 Approximation by Brownian Motion 

5.1 The Invariance Principle — setup 

This is the last section in which technical preliminaries are discussed. In it, we review a 
form of Donsker's Invariance Principle that shows that under suitable normalization, "nice" 
partial sums of random variables are close to Brownian Motion. All results in this section 
are quite standard and can be found in many books on Brownian Motion, e.g. [3] 

Consider the vector space C = C([0,1],R) of all real- valued continuous functions on 
the unit interval, with the sup norm 

||<K0IUp= sup \<p{ s )\, <K-)e a 

0<s<l 

This gives C a metric and a topology, and from now on we shall think of C as a measurable 
space with the Borel cr-field. Brownian Motion is simply a probability measure on this 
measurable space, or rather a random variable B{-) taking values on C, whose defining 
properties are: 

• Pr(B(0) = 0) = 1; 

• for all < So < si < • • • < Sjt < 1, the random variables 

B( Si ) - S( 8i _!) ^ ' 



y/sl s i— 1 



i=l 



are i.i.d. standard Gaussians. 
We will also use the following distributional equalities below: 

max B(s), — min B(t) = d \N\ where iV is standard Gaussian; (21) 

0<s<l 0<<<1 

B( ) — B'( ) 

if B'(-) is an independent copy of B, { '' V} = d B. (22) 

v2 

Now consider a "triangular sequence" {^ n ,t}teN,i<n<Af t of independent, 0-mean, square- 
integrable random variables,. Letting 

(k \ k 

E = E Var fe'.*) ( k G [ M *D and <t = 0, (23) 
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we define a random element Sj(-) (t G N) of C as follows. 



2 



V^ fc 0) C J_ I S a k(t),m \ t 

St(a) = ^ 1 ' — - V , (24) 

a M t ,t 

where s G [0, 1] and (25) 

fc(s) = maxjfc G [M*l U {0} : -^L < t\ . (26) 
I °M t ,t J 

Thus Sti^k t/ a Mt,t) * s ^ ne sum °^ * ne ^ nrs * 0.*' s ' divided by a normalizing factor; and 
for s G [c r kt/ (7 M t ti a k+it/ (7 M t t]> "*( s ) is defined by linear interpolation of the values of 
^ti&t M t / a t M t ) an d ^t( a k+i M t l°t M t )- One can cnec k that this is indeed a measurable 
element of C. 

The Invariance Principle states that if the sequence {£n,M t } satisfies certain conditions, 
the distribution of the Hj(-)'s converges weakly to a standard Brownian motion B(-). What 
this means is that if A C C is measurable with boundary dA and Pr (B(-) G dA) = 0, then 
Pr (Hj(-) G A) — > Pr (£?(•) G A) as i — > +oo. A sufficient condition for this is given by 

Theorem 5 (Special case of Donsker's Invariance Principle) If 



M * nt .i3i 

— > as t — > +oo, 



(7 



3/2 



i/ien i/ie sequence Hj(-) converges weakly to £>(•)• 



5.2 Application to the continuous-time process 

Our typical application of this invariance principle will be to the random variables in 
the exponential embedding. In the notation of the Section |31 let G [B] be fixed, 
{xt}t, {M t }t C N be sequences, and consider the triangular array of random variables 

{U,t = X(i, xt + n-l)- X(i', x t + n- l)} meNjl < n < Mt • (27) 

In this = 2S2(xt,xt + Mt) and the condition in Theorem [21 can be seen to be 

equivalent to 

S 3 (xt,x t + Mt) _ 
m^(S 2 (x u x t + M t ))^- U - [2 *> 

That is, equation Q28|) is the only condition we have to check in order to apply the Invariance 
Principle to the terms in (|27|). Notice also (by a simple limiting argument) that if ^(l) < 
+oo, we can also take M t = +oo in the above. 
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6 The scaling result for leadership 



6.1 The general statement 

Recalling Section 14. H let / be a ELM function. There exists an xq such that h{x) 
xf'{x)/f{x) > 1/2 for all x > xq, which means that for such large x 



^ ) = V4^F2 ( ^ 0) (29) 

is a well-defined, positive function. Our generalization of Theorem|2]shows that the quantity 
qo(t) plays the same role that as the map t i— ► -y/i/(4p — 2) in the specific case f(x) = x p , 
p > 1/2. That is, in order to bias a balls in bins process started with t balls towards 
leadership by a given bin, the difference between the initial numbers of balls should be 

e («,(*))• 

Theorem 6 Let f be a ELM function and define qo as above. Let A £ R be a constant, 
and assume that q = q{n) (n £ N) is such that 

• t/2 ± A q(n) £ N for all n £ N; 

• <z( n ) ~ Qo(n) f or n S> 1. 

7Vo?« consider the 2-bin balls-in-bins process started from initial state 

t . , s t 



Then 



(x(t),y(t))=[- + Xq(t),--Xq(t)) £ N 2 . 



Jim^Pr^)^)) (ELeadx) = $(A), (30) 



lim Pr( x( t) )2/(t )) (Leadi) = T(A). (31) 



Table ^ presents estimates of Qo( re ) f° r n large, for several choices of feedback functions 
in the ELM regime. In particular, the case f{x) = x p (p > 1/2) of the Theorem implies 
Theorem |2I The remainder of this section contains the proof of the general result. 

Proof: [of Theorem^] We start by discussing how one can write the event ELeadi and lead 
in terms of the exponential embedding. We only prove the result for A > 0: the case A = 
is a simple extension, and the case A < reduces to the one we discuss below. 

Let (x, y) £ N B be the (for the time being arbitrary) initial conditions with x > y. The 
event ELeadi holds whenever bin 1 reaches reach level M before bin 2 does for all large 
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/(*) = 


h{x) ~ 


q (x) ~ 


conditions 


X Pln q (x + e - 1) 


p 


\/ 4p-2 


p > 1/2, g£R 




(a + l)qln a {x) 


V 4(a+l) g ln Q (x) 


p > 1/2, g,a > 


e x 


px p 


1-P 
2^ 


< p < 1/4 



Table 1: Some ELM feedback functions and their corresponding h and qg. The last condition 
describes the conditions on the parameters p, q, a under which each / is indeed ELM. 

enough M. This requires that the time it takes for bin 1 to reach level M in the continuous- 
time process is smaller than the corresponding time for bin 2. In the exponential embedding, 
this corresponds to 

M M 

3M VM > M Y^X(l,j) < ]TX(2, j). 

j=x j=y 

The above event can be rewritten as 

M x-l 

3M VM > M ^2(X(l,j) - X(2,j)) < ]Tx(l, j). 
j=x j=y 

As noted in the proof sketch for Theorem^™ Section[TU ^2jL x {X(l,j)— X(2,j)) converges 
as M — ► +oo. If follows that, except for a null event, the above holds if and only if 

+oo x—1 

]T(X(l,j)-X(2,j))<^X(2,j). 

j=x j=y 

Thus we deduce that 

(+oo x—1 \ 

Y,(X(l,j) - X(2,j)) <Y,X(2,j)\ . (32) 
j=x j=y j 

What about the probability of Leadi? Using the above notation, Leadi holds if for all 
M > x bin 1 reaches level M before bin 2 does. That corresponds to 

M M 

VM>x J2x(l,j)<"£,X(2,j), 

j=x j=y 

or 

M x-l 
VM>x, t ^(X(l,j)-X(2,j)) <Y J X{2,j). 

j=x j=y 
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It follows that 

M x-1 



Pr {Xiy) (LeadO = Pr sup ^(X(l, j) - X(2,j)) < J^X(2, j) . (33) 
V' •> j=y J 

Recall now the choice x = x(t) = t/2 + Xq(t) and y = y(t) = t/2 — Ag(t), where 



As discussed in Section 15.21 { A(l, x(t) + n — 1) — X(2, x(i) + n — l)}teN,neN is a doubly- 
infinite array of centered, square-integrable random variables with 

t 

a n,t = Var W 1 ' x ^ + n - 1) - Z(2, x(t) + n - 1)) = 2S' 2 (x(t), x{t)+n) (1 < n < +oo). 
i=i 

One can construct a random continuous path H t (-) defined by setting 
'S 2 (x(t),x(t)+n)\ _ Y,f^ n ~ X m,3)-X{2,j) 



, ,(l<n<+oo) (35) 

S2(x(t)) J y / 2S 2 Jx) 'V - - ) K ) 

and completing the remaining values by linear interpolation. As discussed in Section 15.21 
the fact that we have infinite terms here poses no problems. Moreover, it is easy to check 
that 

1 M 

sup ~ t (s) = -==sup^X(l,j)-X(2,j), (36) 

0<s<l y , 2S2(X) M>x ~^ 



j=x 

-oo 



J=X 

Thus one can rewrite 



1 -r^j 

s,(1) " 7HM Ex(u) " x(2J) ^ (37) 



Pr^CEl-dO = Pr (W) < ^fg^) . (38) 

Pr (8|S) (Leadx) = Pr sup E t ( S ) < ^f=^ • (39) 

\o<s<i ^/252(x) y 

Suppose we show that as i — > +oo 

"*(') — >1 " a standard Brownian Motion .£?(•), (40) 



E-=^(2,i) 



■>3=y 



V2S2[x) 



A. (41) 
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Since Y^j=y is independent of Sj(-), this means that 

(H t (-),E X ( 2 ^')) - 10 (S(0,A), 

which implies 

Pr (Xiy) (ELeadx) - Pr(B(l)<A), (42) 

Pr/,^(Leadi) -> Pr f sup B(s) < A I . (43) 
^ ; Vo< s <i / 

These probabilities can be evaluated via standard formulae for Brownian motion in Sec- 
tion yielding the final result. We thus concentrate on proving equations (|1U)) and 1)41(1 . 

Proof of gZJ). By SectionO it suffices to show that S 3 (x(t)) <C S 2 (x(t)) 3 / 2 . But this 
follows directly from the formulae in Lemma ^ the fact that x = x(t) — > +oo, and the 
assumption that /i(x) <C \/^ : 

/ \ 3/2 

S3(l '» ~ (3Mx) - !)/(») W) - !)/»(,) ) = S2(al)3/2 - 



Proof of 14 Let us first establish a few facts about /, x, y and q. 

1. For i > 1, g(i) ~ f{t/2)^J S 2 {t/2)/2 = 0(t). Indeed, for f large, Lemma Q] implies 
that S 2 {t/2) ~ t/{4h{t/2) - 2)/(t/2) 2 , and, because of (ITTT1) in D efinition □ /i(t/2) ~ 

. Moreover, since liminf n ^ +00 h(n) > 1/2, g(i) = y/t/Q (1) = O (v^)- 

2. For t > 1, f{x) = f{t/2 + Ag(t)) ~ /(t/2). In this case 



In 



/(*) 



-Ag(t) 



(In /(«))' d« 



(44) 



-A 9 (t) 



/i(n) 



< 



sup \h(u)\ In ( 1 + 

|<u<|+A(?(t) 



2q(t) 



Now notice that equation 1)19)1 implies that the sup of /i(ti) above is ~ h(t/2) ~ /i(t). 
Moreover, g(t)/t = y/l/t(2h(t) - 1) < 1, since lim inf n^+oo ft,(n) > 1/2. Therefore, 



In 



h{t)q(t) 



O 



1 

7* 



o(l). 



(45) 
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3. For t > 1, f(y) ~ f(t/2). The proof is almost identical to the one above. 

4. For t S> 1 and r > 2, S r (x), S r (y) ~ S r (t/2). Indeed, because = 0(\/^), 

y —* +oo„ and the formulae in Lemma^apply. Thus ^(x) ~ x/[(r/i(x) — l)/(x) 2 ], 
and by 1. and 3., x ~ i/2, /i(x) ~ h(t/2) and /(x) ~ f(t/2), which implies S^x) ~ 
(t/2)/[(rh(t/2) — l)/(i/2) 2 ] ~ S r (t/2). The same argument proves the desired result 
for S r (y). 

We now apply the estimates to the problem at hand. For t large enough (so that x and 
y are also large), we can ensure that / is increasing on [y, x], so that 



x-y 



< Ex 



x-l 
i=y 



x—1 ^ 

Tf. 



< 



x-y 



j=y 



By items 2. and 3. above and the definition of x,y, this implies that 

x-l 



Ex 



Similarly, one can show that 



i=y 



x-l 



2Xq(t) 



f{t/2) 



Var |^I(2, 

J=y 



2Xq(t) 

WW 



(t » 1). 



Therefore, using 4., 

x-l 

Ex 



J=v 

By Chebyshev's Inequality, it follows that 



x-l 



> (l-o(l))2A?(t)Var [ ^X(2,£) ] (t > 1). 

i=y 



Zi=yX(2,£) 

/ 2Ag(t) \ 



1 as i — > +oo. 



Finally, since 



we have 



2f 



Mt) / 

/(t/2) ~ V ( 2h (t) - mm? 



EtyX(2,l) 

\^2S^(x) 
which is the desired result. □ 



1 as t — * +oo, 



(46) 



(47) 



(48) 
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7 The almost balanced regime 



This section proves our result on the overtaking time, a generalization of Theorem |31 To 
recapitulate: Theorem ^ tells us that, when the feedback function / satisfies 

+00 

g/(ip = +00 ' (49) 

each of the two bins will be the one with more balls infinitely many times. Our main 
interest in this chapter will be in determining how long it takes for bin 1 to have more balls 
than bin 2, given that the latter bin has more balls at the start. More specifically, assume 
the process (h(-), h{')) is started from state 

( \at] , t - \at] ) , t > 1 and a G (0, 1 /2) fixed. (50) 

As in the introduction, let V be the overtaking time of the process: that is the first time 
when bin 1 has more balls than bin 2. 

V = mm{v G N:7i(«) > h(v)}. (51) 

Under condition (|49|). this min exists and is finite with probability 1. We will be interested 
in describing the asymptotic distribution of V. 

To express our main result, let us introduce two mappings. 



F tta : ((l-a)t,+oo) -» R+ 

r(l-a)t dx /c;o^ 

^ Jet TT^) , (OZ) 

I ru dx 
V J(l-a)t J(xY 

Gt a = the inverse of F t>a . (53) 



u 



Notice that lim^^^^^ Ft,a(v) = +00 and 

r{l-a)t dx 

lim ^(ti) = ^ fix) = 

«^+oo / r+00 dx 

because (as a consequence of f(j)~ 2 = +00) the denominator in the RHS is infinite. 

Thus Ft a is a monotone-decreasing function whose range is R + , and Gt, a is not only 
well-defined, but monotone-decreasing as well. 
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Theorem 7 Assume that f is a A.B function (cf. Definition fTjj . and define Ft,ai as 
above. Let Vt, a be the random variable V defined above, conditioned on the initial state 
{\ai\,t — \at\) of the balls-in-bins process. Then, as t — > +00, 

VASR+, lim Pr(t$, a >2G t , a (A)-(* + l)) = r(A), (54) 
(where T(-) is defined as in Theorem^), or equivalently, 

(Y^±l±ll) ^ m , (55) 



for a standard Gaussian random variable N. The latter expression makes sense because 
V t , a > (1 - a)t, so 

V t , a + (t+l) / a\ 

> 1-- U> (l-a)t. 



2 ~ V 2 

This result is quite general, but applying it to a specific situation requires a calculation. 
We do this for the case f(x) = x p below, and then prove Theorem below. 

7.1 Proof of the special case 
Claim 1 Theorem^ implies Theorem^ 

Proof: One way of interpreting Theorem in the f(x) = x p case is by saying that 

Vt ta = 2Gt, a {Ut,a,p) — (t + 1) with probability — > 1, 

where Ut ap — > w \N\ as t — ► +00. Thus the Corollary follows from providing a formula for 
Gta- We will first assume that < p < 1/2, in which case 

e( * ,Q> s L m i <56) 

= [(1 - a) 1 '' - a l -']-f-^, (57) 

1 — p 



and for all u > (1 — a)t 



„i-2p _ [(1 _ a )t]i-2p 



/(M ' a) s 7(^75? (58) 



< 59 > 
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Then, for u > (1 — a)t, 



*U«) = 7#= , (60) 
\/f{u,t,a) 



[(l- a) i-P- a i-P]g 

jl-2p_[(l_ Q ,)f]l-2F 
l-2p 



(61) 



To compute Gt jQ ,(A) for some A G R + , we must solve the equation Ft ta (Gt, a (\)) = A, which 
corresponds to 

GU^~ 2P = [(1 " «)^" 2P + {1 ^ El {[(1 " «)^ P " • (62) 



Therefore, 



and the result for < p < 1/2 follows. 

For p = 1/2, the above formula for e(t, a) still applies, but 



and thus 



f(u,t,a) = / —- (64) 

J(l-a)t J\ x ) 

= In tt^t, (65) 
(1 — a)t 



G t)Q (A) = (l- a )texp<!^^|> (0G) 



A 2 



(1 - a) t exp |4[1 - 2y/a(l-a)] ^ j 



This finishes the proof. □ 



(67) 



7.2 Proof of Theorem [3 

Proof: Define the overtaking number Nt <a to be Ii(Vt ja ), i.e. the number I\{v) of balls in 
bin 1 at the first time v when Ii(v) > hiy), under initial conditions (|~at],t — [at]). It 
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follows from this definition that at time v' = Vt, a — 1, It(v') = l2( v ') = Nt a — 1. Since 
ii(0) + hiO) = t, this means that 

V t , a -l + t = h(v') + I 2 (v') 

= total # of balls at time v' 

= 2iV t>Q -2 

=*^, a = 2iV t , a -(t + l). (68) 

Thus results about the distribution of Nf a translate immediately into results about Vt a . 
Since Nt a is easier to analyze via our techniques, we shall spend most of our time considering 
this quantity, returning to the more significant Vt a at the end of the proof. 

We begin by showing that, in terms of the exponential embedding random variables, 

{m-i y-l 1 

sup y2(x(2j)-x(i,j))<y2x(i,i)\ , (69) 
y<m<M-l ^ f X J 

where x = \at\ (respectively y = t — \at\) is the initial number of balls in bin 1 (resp. 2). 
Indeed, Nf a > M occurs if and only if for all y < m < M — 1, the time it takes for bin 2 to 
receive its mth ball (which is Y^j=y -X"(2, j)) is smaller than or equal to the time it takes 
for bin 1 to receive its mth balls (which is Yll^~x (!,£))■ Symbolically, 



VMeN {N t , a >M} = < 



m—X m—l 



Wy < m < M- 1, X ( 2 >i) ^ E ) ' 



3=v 



from which (|69|) follows. 

We now wish to estimate the probability of the event at the RHS of (j69j) . We begin by 
looking at 

m— 1 

sup T(X(2,j)-X(l,j)). (70) 

y< m <M-l ^ 

In particular, let us assume that 

M = [G t , a (X)\ (71) 

for some A G R + , so that 

{N t>a >M} = {N t>a > G t>a (X)}. (72) 
We wish to apply the Invariance Principle to the random variables 

{U,t = X(2,y(t) +n - 1) -X(l,y(i) + n - l)}teN,l<n<Af-j/(t)-i- 
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Note that 



u 2 n>t = Var (X(2, y(t) + n - 1) - X(l, y(t) + n - 1)) = 2S 2 (y(t), y(t) + 

3=1 



n). 



Following Section f5. 21 we build the random path Hf(-) that linearly interpolates the values 



„ fS 2 (y(t),y(t)+n)\ _ ELi X(2, y (t) + j - 1) - X(l, y(t) + j - 1) 



S 2 (y(t),M -1) J ^/2S 2 (y,M-l) 
As in the previous proof, we have 

m— 1 



1 < n < M-y(t). 



sup H t (s) 

0<s<l 



sup,< m < A/ _! gg X(2, y(t) X(l, y(t) + j - 1) 

^25 2 (y,M-l) 



Hence 



Pr (N t>a > M) = Pr I sup S t ( S ) < ' JJ 



o<s<i v / 2S 2 {y,M - 1) 



We will eventually prove that as £ — ► +oo, 

^ A, 



2/-1 



a standard Brownian Motion -B(-). 



It follows from this and the independence of H^(-) ^2^j =x X(l,j) that 

lim Pr (N t a > M) = Pr j sup B(s) < A ] = T(A), 

'^+°° V0<s<l / 

which is the desired result. Thus we concentrate on proving (|74|) and (j73j) . 

Proof of (73J) . The expectation of Y%= x X(l,£) can be estimated as follows. 



Ex 



a/ 



f(at) 



> 1. 



(73) 
(74) 

(75) 



(76) 

(77) 
(78) 
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Indeed, ((77|) follows from the fact that Y2j>i fti) 1 = an< i <178)l follows from the 

assumption h(s) < 1/2 for s large, which means that /((l — a)t) = O (f(at)) = O (Vi)- 
The variance of the is 



Var (j2x(l,£) \ = S 2 (\at],t - \at]) = O (j^j « Ex 



I=x 



it » 1) 



because for s large, h(s) > and thus / is decreasing. By Chebyshev's Inequality, it follows 
that 



(l-a)t _ds 
at f 



1. 



(79) 



On the other hand, notice that if M' = Gt iQ (A) satisfies 



Ft am') 



r(l-a)t ds 
■lot JJ7) 

9 rM' ds 

Z J(l-Q)t JW 



A. 



We wish to show that 
this will follow from 



F t t0i (M) ~ A; 



(80) 



(81) 



M' 



ds 



M 



ds 



l(l-a)tf(s) 2 J(l-a)t f(s) 2 ' 

Since M = [M'\ , M < M' < M + 1 and 



ds 



(l-a)t f( s ) 2 J(l-a)t f(s) 



M 



ds 



< 



f(M 



A2 ' 



This follows from the fact that h(s) > (and thus / is increasing) for s large enough. On 
the other hand, we must have M' — (1 — a)t 3> 1, as for any constant C 



(i-*)t+c ds 



< 



C 



l(i-a)t f(s) 2 ~ /((I " a)t) 
contradicting (jHOJ. Thus 



< 1 < 



(l-a)t 
at f(s) 



rM' 
J(l-a)t 



(l-a)t f(s) 



ds M' — (1 — a)t 



f(M 



l\2 



M 



ds 



-a)t f{sf 



M _ds_ 

(l-a)t /(*) 2 
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which proves (|81|) . In particular, it follows from ()79|) that 



J(l-a)i JW 

Finally, since by l|78|l the numerator of 



M ds (82) 



r(la)t 



9 fM ds 



diverges, we deduce that J(^_ a ^ t jfjpfz 3> 1, which implies that 

r M ds 

J(i-a)t f{sr 

Plugging this into (jHU) yields ((75|) . 



Proof of J7^| ). We have already shown above that S 2 (y,M — 1) ~ /(i_ a )t yra? ^ 1- 
Since / is bounded below, 

S 3 (y, M - 1) = O (S 2 (y, M - 1)) « S 2 (y, M - if' 2 . 

Thus the condition for the Invariance Principlie in Section f5. 21 is satisfied, and this finishes 
the proof. □ 



8 Extensions and open problems 

Our Theorem |f)l admits an extension to the case of a general number B > 2 of bins. 

Theorem 8 Let f be a ELM function, B > 2 and Aj € R (i € B) be constants, and assume 
that q = q(n) (n G NJ is such that 

• t/B + \i q(n) £ N for alln£~N and i £ [B]; 

• <z( n ) ~ loin) f or fi>l (qo is defined in \2iJ\) ). 

Now consider the B-bin balls-in-bins process started from initial state 

(*i(t))f=i= (l + Ai^t)) 5 GN B . 
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Then 



lim Pr 

t— »+oo 



K(t))f =1 (ELeadi) 




lim Pr 



\xi(t))(L 



(Leadi) 



Pr V2 < * < B , sup 



0<*<1 



(B 1 (t) - Bj(t)) 
V2 



< (Ai - Xi) 



where {Bi(-)}f =1 ({Ni}f =1 ) are i.i.d. standard Brownian Motions (resp. Gaussians). 

The proof of this result, which we omit, follows essentially the same lines as that of 
Theorem |HJ The only major differences is that the Invariance Principle is applied to the 
sequences 



and some extra care must be taken in looking at the maxima of the corresponding random 
continuous functions. 

It is not entirely obvious how Theorem [7| on the almost balanced regime should or could 
be generalized to more than 2 bins. All we can say in general is that the addition of bins to 
the process will always make any overtaking take longer (to see this, notice that more bins 
just add more arrivals in the continuous-time exponential embedding between the start and 
overtaking times). In fact, there is a more basic question about such processes that we 
cannot answer; it was posed as a conjecture by Joel Spencer. 

Conjecture 1 (Joel Spencer) Consider a balls-in-bins process (Im(i))ie[B],m>o with feed- 
back function f that is in the almost-balanced regime (cf. Theorem^). Then for all per- 
mutations II of [B] and all initial conditions, with probability 1 there are infinitely many 
m > with I m (n(l)) < 7 m (II(2)) < • • • < I m (TL(B)). That is, all permutations possible of 
the bins occur infinitely often almost surely. 

Perhaps the techniques presented here could be used to settle this problem. 
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